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Abstract. A fundamental problem in the study of phylogenetic net¬ 
works is to determine whether or not a given phylogenetic network con¬ 
tains a given phylogenetic tree. We develop a quadratic-time algorithm 
for this problem for binary nearly-stable phylogenetic networks. We also 
show that the number of reticulations in a reticulation visible or nearly 
stable phylogenetic network is bounded from above by a fnnction linear 
in the nnmber of taxa. 


1 Introduction 

Genetic material can be transferred between organisms by hybridization, recom¬ 
bination and horizontal gene transfer besides traditional reproduction. Recent 
studies in comparative genomics suggest that these “lateral” processes are a 
driving force in evolution which shapes the genome of a species I3I11I16) . Ac¬ 
cordingly, phylogenetic networks have commonly been used to model reticulate 
evolutionary histories of species mm- A plethora of methods for reconstruct¬ 
ing reticulate evolutionary histories of species and related algorithmic issues have 
extensively been studied over the past two decades |hlbll:jll3ll4ll71 . 

A phylogenetic network is an acyclic digraph with a set X of labeled leaves 
(that is, vertices of outdegree zero) and a root (having indegree zero). The leaves 
are in one-to-one correspondence with a collection of taxa under study, whereas 
the unique root represents their least common ancestor. Vertices with indegree 
one represent speciation events. Vertices of indegree at least two represent an 
evolutionary process by which genetic material was horizontally transferred from 
one species to another. 

A fundamental question in the study of phylogenetic networks is to determine 
whether a tree is displayed by a phylogenetic network over the same set of taxa 
(in a sense we define precisely below). This problem is called the tree containment 
problem [S]. Answering this question is indeed useful to validate and justify a 
phylogenetic network model by testing whether it displays existing phytogenies 
over a set of taxa under study. 

The problem is N P-complete in general [5] , even on the more restricted class 
of tree-sibling time-consistent regular networks [7]. Although great effort has been 








devoted to the study of that problem, it has been shown to be polynomial-time 
solvable only for a couple of interesting classes of phylogenetic networks, namely, 
normal networks and tree-child networks [7]. Determining the complexity of the 
tree containment problem for a class of phylogenetic networks that properly 
contains tree-child networks, particularly those with the so-called reticulation- 
visibility property, is an open problem m- 

In this paper, we study the tree containment problem for nearly stable phy¬ 
logenetic networks (defined in the next section), which generalize normal and 
tree-child networks. Recombination histories of viruses, hybridization histories 
of plants, and histories of horizontal gene transfers reported in literature often 
satisfy the property that defines those networks |8I1()] . Our key results include: 

(i) the number of reticulations in a reticulation-visible or nearly stable phyloge¬ 
netic network is linearly bounded from above in terms of the number of taxa; 
and (ii) the tree containment problem for nearly stable phylogenetic networks 
can be solved in quadratic time. Omitted proofs and details will appear in the 
extended version. 


2 Concepts and Notions 

A (phylogenetic) network on a set X of taxa is a directed acyclic graph with a 
single root (a vertex with indegree 0) which satisfies the following properties: (i) 
its leaves (vertices with outdegree 0) are in one-to-one correspondence with the 
taxa in X\ (ii) there are no vertices with both indegree one and outdegree one; 
and (iii) there is a path from the root to any other vertex. We identify each leaf 
with the taxon corresponding to it and refer to the directed edges (tail, head) 
as branches. 

In a network, reticulation vertices (or simply reticulations) are vertices with 
indegree at least two and outdegree one; tree vertices are vertices with indegree 
one and outdegree at least two. A branch is a tree branch if it ends at a tree 
vertex; it is called a reticulation branch otherwise. 

A network is binary if its root, leaves and the other vertices have degree 2, 
1 and 3, respectively. A phylogenetic tree is simply a binary network without 
reticulations. 

For a binary network N, we shall use rjy to denote the root of N. Let x and 
y be vertices in N. We say that a; is a parent of y and y is a child of x if (x, y) 
is a branch. More generally, we say that x is an ancestor of y and equivalently y 
is a descendant of x if there is a directed path from x to y. A vertex x in A^ is a 
stable ancestor of a vertex v if it belongs to all directed paths from rjv to v. We 
say that x is stable if there exists a leaf £ such that x is a stable ancestor of £. 

Proposition 1. Let N be a binary network. The following facts hold. 

(1) A vertex is stable if it has a stable tree child. 

(2) A reticulation is stable if and only if its unique child is a stable tree vertex. 

(3) If a tree vertex is stable, then its children cannot both be reticulations. 



A network is a tree-child network if every vertex has a child that is a tree 
vertex [5]. It can be proved that a network is a tree-child network if and only if 
every vertex is stable. It is reticulation-visible if all its reticulations are stable [5] . 
It is nearly stable if for every vertex, either that vertex is stable or its parents 
are. 

Contracting a branch {u, v) means replacing it with a single vertex w in such 
a way that all neighbors of u and v become neighbors of w. Given a binary 
phylogenetic tree T and a binary network N, we say that N displays T if there 
is a spanning subtree T' of N that is a subdivision of T, i.e. T' has the same 
vertex set as N and T can be obtained from T' by contracting all branches in T' 
incident with the vertices with outdegree I and indegree I, all branches incident 
with the “dummy leaves” (leaves in T' that correspond to tree vertices in TV), 
and all branches incident with a vertex of indegree 0 and outdegree 1. Figure]^ 
shows an example of a phylogenetic network N and a tree that is displayed in 
N. 
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Fig. 1. (A) A phylogenetic network. (B) A spanning snbtree of N obtained after the 
reticulation branch between the parents of c and h is removed. (C) A tree displayed in 
N throngh the subtree in (B). 


In this work, we study the tree containment problem (TCP), which is that of 
determining whether a phylogenetic tree is displayed by a network or not. 

3 How Many Reticulations in a Network? 

An arbitrary network with n leaves can have a very large number of reticulations. 
To analyze the time complexity of an algorithm designed for solving a network 
problem, we need to bound the size of the network by a function of n. 

Removing a reticulation branch from each reticulation in a binary network 
N yields a spanning subtree T'. All leaves in N are still leaves in T', but T' 
may additionally contain some “dummy leaves” that correspond to tree vertices 
whose outgoing branches have both been removed. The following lemma says 
that it is always possible to remove proper reticulation branches so as to obtain 
a tree without dummy leaves. 

Lemma 1. Let N be a binary reticulation-visible phylogenetic network. We can 
determine which reticulation branch to remove at each reticulation so that the 
tree obtained after removing the selected branches contains no dummy leaves. 


Proof. Let T be a tree obtained from N by removing exactly one reticulation 
branch incident to each reticulation. In order for T not to contain any dummy 
leaves, we need to guarantee that the reticulation branches to be removed are 
incident with different tree vertices. In other words, the branches to be removed 
form a matching that covers every reticulation in N. Since N has the reticulation- 
visibility property, the parents of each reticulation are both tree vertices (Propo- 
sition[^. Such a set of reticulation branches exists and can be found by applying 
Hall’s Theorem to a bipartite graph with tree vertices and reticulations as vertex 
sets and reticulation branches as edges. Since each reticulation is the head of two 
reticulation branches and each tree vertex is the tail of at most two reticulation 
branches, there exists a matching that covers all the reticulations (see a result 
of N. Alon on page 429 in P). □ 
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Fig. 2. Illustration of the different cases in the proof of Theorem P A. Definition of 
cross and non-cross branches removed from a path. B. The branch (x, y) is a non-cross 
branch removed from a path. Assume that a cross branch ( 2 ', z) has been removed from 
a reticulation 2 inside the segment from x and y, where 2 ' is not shown, and two cross 
branches have also been removed from two tree vertices H and t 2 between 2 and y. 
C. Some cross branches must have been removed from their tails located between the 
heads of two non-cross branches that are removed from a path (in this case, between y 
and y'). D. If two cross branches have been removed from two reticulations in a path, 
then the upper reticulation {vj here) is not stable. 


Theorem 1. Let N be a binary reticulation-visible phylogenetic network with n 
leaves. Then N has at most 4(n — I) retieulations. 

Proof. Assume N contains m reticulations. By Lemma P we can obtain a tree 
T without dummy leaves by removing m reticulation branches from N. Since N 
is binary, an internal vertex in T has either one or two children; equivalently, T 
is a subdivision of a rooted binary tree T' over the same leaves as N. Therefore, 
T' has n — 1 internal vertices (including its root) of outdegree 2 and there are 
2n — 2 paths Pi {1 < i < 2n — 2) satisfying (i) the ends of each Pi are either the 
root of T, a leaf, or internal vertices of outdegree 2, and (ii) each internal vertex 
of Pi has both indegree and outdegree 1 if Pi consists of two or more branches. 


For each path Pi of length > 2, an internal vertex of Pi is either a tree vertex 
of N, whose outgoing branch not in Pi has been removed, or a reticulation, whose 
incoming branch not in Pi has been removed. For convenience of discussion, we 
divide the removed reticulation branches into cross and non-cross branches 
(with respect to T) (Figure [2j\). A removed branch is called a cross branch if 
its tail and head are located on two different paths Pi and Pj, i ^ j, otherwise 
it’s called a non-cross branch. We first have the following facts. 

Facts 

(1) If (x,y) is a non-cross branch removed from Pi, then at least one cross 
branch has been removed from its tree vertex tail in the segment Pi[x,y\ 
from X to y of Pi, and there is no reticulation in Pi[x,y\ other than y. 

(2) Let (x,y) and {x',y') be two non-cross branches removed from Pi, where 
y is an ancestor of y'. Then there exists at least one cross branch being 
removed from its tree vertex tail located between y and y' (Figure]^). 

(3) There are at least as many cross reticulation branches removed as non¬ 
cross reticulation branches. 

Proof. (1) Since N contains no parallel branches, Pi[x,y] has at least three 
vertices, so it suffices to prove that y is the only reticulation in Pi[x,y\. 
Assume on the contrary that a branch {z', z) has been removed from a retic¬ 
ulation z in Pi[x,y\ (Figure]^). Then there is a path including {x,y) from 
rjq to a leaf below y that avoids z, so z is not stable on any leaf below y 
(and hence below z) in T (and hence in N). Moreover, since T is a sub¬ 
tree of N, z cannot be stable in N on any leaf that is not below z in T. iV 
and T have the same leaf set, hence z is not stable in N, contradicting the 
reticulation-visibility property. 

(2) Note that y and y' are reticulations in N. By Fact (1) above, y must be 
above x', and there is a cross brach removed from its tree vertex tail located 
between x' and y'. 

(3) By Facts (1) and (2), we can establish an injective map from the set of 

non-cross reticulation branches to that of cross ones. Hence, the statement 
in this part is also true. □ 

Assume at least 2n — 1 cross branches {ti, ri) have been removed from the 2n—2 
paths Pi. At least two heads Vj and Vk are on the same path Pi (Figure]^). 
Using an argument similar to that used in the proof of Fact (2), one of rj and r/j 
which is upstream in Pi is not stable, a contradiction. Therefore, at most 2n — 2 
cross branches have been removed to produce T. By Fact (3), there are also at 
most 2n — 2 non-cross branches removed during the process. Since we removed 
one incoming branch for each reticulation, we conclude that there are at most 
4(n — 1) reticulations in N. □ 

Lemma 2. Let N he a binary nearly stable network, and let UretiN) (resp. 
Sret{N)) denote the number of all unstable (resp. stable) reticulations in N. We 
can transform N into a binary reticulation-visible network N' with the property 
that N' has the same leaf set as N and Sreti^) ^ 5'rei(-^0 ^ Siret{N) + Uret(,dI)- 


Proof. Let a be an unstable reticulation in N, whose child is denoted by b. Since 
N is nearly stable, b is stable. By Proposition [^2), & is a stable reticulation. Let 
c denote a parent of a; then c is stable by definition of N, and it is a tree vertex 
by Proposition 1^2). Let d denote the other child of c. Since c is stable, d is a 
tree vertex (Proposition [^3)). In addition, d is stable. 

Assume on the contrary that d is unstable. Then both its children must 
be stable by the nearly-stable property of N. Hence, by Proposition [^2) and 
the fact that d is unstable, both its children are stable reticulations. Since a is 
unstable, a is not a child of d. This implies that c is unstable, a contradiction. 

Finally, let e be the parent of c. / be the other parent of a and g be the 
other parent of b (see Figure [^. Note that g f. Otherwise, / is unstable, 
contradicting that there are no two consecutive unstable vertices. To transform 
N into a binary reticulation-visible network, we remove unstable vertex a by hrst 
removing the branch (c, a), and then contracting the paths f-a-b and e-c-d into 
branches (/, 6) and (e, d). Both b and d are clearly still stable in the resulting 
network. By rewiring around every unstable reticulation in N, we produce a 
binary reticulation-visible network N\ The inequality follows from the fact that 
no stable reticulation is removed, and no new reticulation is created during the 
rewiring. □ 
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Fig. 3. A An unstable reticulation a, its stable child b and its stable parents (c and 
/) in the original network N. To transform N into a reticulation-visible network, we 
remove the incoming reticulation branch (c, a) (B) and then contract paths e-c-d and 
f-a-b (C). The rewiring eliminates the unstable reticulation vertex a. 


Lemma 3. For a binary nearly stable network N, Uret{N) < 2Sret{d^)- 

Proof. Directly follows from the fact that an unstable reticulation must have a 
stable reticulation as its child, and any stable reticulation can be the child of at 
most two unstable reticulations. □ 

Theorem 2. Let N be a binary nearly stable network with n leaves. Let T(N) 
denotes the number of tree vertices in N. Then: 


(i) N has at most 12(n — 1) reticulations; 

(it) |r(A)| < 13(n- 1) and |F;(A^)| < 38(n- 1). 



Proof, (i) Theorem[^and Lemmas[^and[^imply 5'rei(-/V)+t/ret(-^) < ^Sret{N) < 
3Sret{N') < 3(4n - 4) = 12(n - 1). 

(ii) We can think of the network as a flow network, with as source and the n 
leaves as sinks. Hence, the number of tree vertices equals n — 1 plus the number 
of reticulations, that is, at most 13(n — 1) (by (i)). Since the outdegree of the 
root is two, and the outdegrees of each tree and reticulation vertex are 2 and 1, 
respectively, N has 2(13n — 13) + 12(n — 1) = 38(n — 1) branches at most. □ 


4 A Quadratic-Time Algorithm for the TCP 

In this section, we shall present a quadratic-time algorithm for solving the TCP. 
If a given network N and a given reference tree T contain a common subphy- 
logeny, then we can simplify the task of determining whether N displays T by 
replacing the common subphylogeny by a new leaf. Therefore, without loss of 
generality, we assume that N does not contain a subphylogeny with two or more 
leaves. We call this property the subphylogeny-free property. 



Fig. 4. All ten possible subnetworks at the end of a longest path in a nearly stable 
network. Here, r is the network root and the directed path from r to is represented 
by a coiled path. The parent w of u is not shown in C. 


Lemma 4. Let N be a nearly stable phylogenetic network satisfying the subphylogeny- 
free property. Let P = {r,... ,w,u,v,£) be a longest root-to-leaf path of four or 



more vertices in N, where r = r^ and I the leaf end. Then the subnetwork con¬ 
sisting of the descendants of w exhibits one of the structures given in Figure 

Proof. Note that v cannot be a tree vertex: since P is a longest root-to-leaf 
path, the other child of v would otherwise be a leaf, thereby contradicting our 
assumption that N satisfies the subphylogeny-free property. Therefore, u is a 
reticulation. There are two possible cases for u. 

1. The rt is a reticulation: Then u is unstable, and w must be a stable tree 
vertex (see Proposition [^2) for both claims), which is stable on t or some 
other leaf. Let g be the other child of w. By Proposition [^3), g is either a 
tree vertex or a leaf. If g is a leaf, we obtain the subnetwork in Figure [^. 
If g is a tree vertex, then neither of its children is a tree vertex: since P is 
a longest path, a tree vertex child of g would have two leaves as children, 
thereby contradicting the subphylogeny-free property. Note that g's children 
cannot both be reticulations either, since otherwise w would be unstable. 
Therefore, one child of g is a leaf and the other is a reticulation with a leaf 
child (again because P is a longest path), as shown in Figure]^. 

2. The M is a tree vertex: Let e denote the other child of u. Note that e 
cannot be a tree vertex, otherwise both its children would be leaves (since 
P is a longest path), which would contradict our assumption that N has the 
subphylogeny-free property. If e is a leaf, we obtain the subnetwork shown 
in Figure |4p. If e is a reticulation, then its only child is a leaf (again because 
P is a longest path), so e is stable on that leaf and u is therefore unstable. 
Since N is nearly stable, w must be a stable tree vertex. We consider the 
other child of ic in the following subcases. 

(2.1) If g is a leaf, then we have the subnetwork given in Figure]^. 

(2.2) If g is a tree vertex and also a parent of e and v, then we obtain the 
subnetwork in Figure]^. 

(2.3) If g is a tree vertex and in addition, g is a parent of e, but not a parent 
of V. then w is stable on P, the unique child of e. Let h be the other 
child of g] then h cannot be a tree vertex, since both its children would 
then be leaves, which would contradict our assumption that N has the 
subphylogeny-free property. If h is a reticulation, its child must be a 
leaf, since P is a longest path. Thus, we have the subnetwork given in 
Figure]^. If h is a leaf, we obtain the subnetwork in Figure |4]GI. 

(2.4) If g is a tree vertex and in addition, g is a parent of v, but not a parent 
of e, then a discussion similar to that of case (2.2) characterises the 
only two possible subnetworks (Figure]^ and[^) in this case. 

(2.5) If is a tree vertex and in addition, g is neither a parent of v nor 
a parent of e: then again we look at g’s children. Both cannot be 
reticulations, otherwise w is unstable, a contradiction. If neither of 
them is a reticulation, then there is a subtree below g; if one of them 
is a reticulation and the other is a tree vertex, then again there is a 
subtree. The only possible case that remains, shown in Figure |^, is 
the case where one child is a reticulation and the other is a leaf. 


(2.6) If g is a reticulation: Then w unstable. This is impossible, as w is a 
stable tree vertex. □ 

The subnetwork below g of the structures shown in Figure 153 , |4p, 0, If 
and that below u in Figure |f match the following pattern: 



in which a leaf i has a reticulation sibling y and a leaf nephew, i'. Such a pattern 
is called an uncle-nephew structure. Note that if (. and t' are not siblings in a 
tree displayed by N, then the reticulation branch (x, y) should not be used. If 
I and I' are siblings, either (x, y') or the other branch entering y can be used. 
Here, since the other branch enters y from an unspecified vertex, it is simply 
called a dangling branch. It is not hard to see that for a tree T in which ^ 
and are siblings, if T is displayed in the network resulting from the removal 
of {x,y), it is also displayed in the one after the dangling branch is removed. 
Hence, to determine whether N displays a tree T, we can simplify the network 
by eliminating y using the following process: 

Uncle-Nephew Reduction In an uncle-nephew substructure shown 
above, remove the dangling branch if £ and £' are siblings in T, or remove 
(x, 2 /) otherwise. Then contract vertices with indegree and outdegree 1. 

In each of the other cases, we can also simplify the network by using infor¬ 
mation on the input tree. To summarize how to simplify the network, we use the 
following notation for each vertex w in a network N\ 

— R{w) denotes the subnetwork consisting of all the descendants of w, 

— (—,x) denotes the dangling branch entering x from its parent not in R(w) 
for X in R{w); 

— N' {x,y) denotes the subnetwork obtained by adding [x,y) into N' for a 
subnetwork N' of N and a branch {x,y) of N] 

— N' — (x, y) denotes the subnetwork obtained by removing (x, y) from N' for 
a subnetwork N' of fV; 

— pt{x) denotes the parent of a vertex x in a tree T. 

Theorem 3. Let N be a binary nearly stable network with no uncle-nephew 
structure, and T a tree with the same set of labeled leaves. Let w be a tree vertex 
in N. Define N' as follows. 

(i) When R{w) matches the structure of Figure^i, define N' = N — {w,u) if £ 
and £' are not sibling in T and N' = TV — {(—, u), (—, u)} otherwise. 


(ii) 


(iii) 

(iv) 

(v) 


When R{w) matches the structure of Figure define N' = N — {—,v) 
when i and I" are siblings, or when I and I' are siblings and their parent is 
a sibling of i" in T, and N' = N — {u,v) otherwise. 


When R{w) matches the structure of Figure 
When R{w) matches the structure of Figure 
if £ and £' are siblings in T and N' = N — (u 


4 E, define N' = N—\{u, e), (g, t))}. 
Jf, define N' = N-{{g,e),{-,v)} 
otherwise. 


When R{w) matches the structure of Figure^^, define N' = N—{{g, v), (—, e)} 
if i and I' are siblings in T and N' = N — {u, v) otherwise. 


Then N' is nearly stable and N displays T only if N' displays T. 


Proof. Since none of the simplifications removes any leaf and all of them only 
reduce possible paths from rjv to a leaf, the resulting network N' is nearly stable. 

Assume R{w) is the subnetwork in Figure]^ and N displays T. Then there 
exists a subtree T' of N that is a subdivision of T and let pt{£) corresponds x 
in T'. Clearly, x is of degree 3 and hence a tree vertex in N. We consider two 
cases. 

CASE A. Leaves I and £! are not siblings in T. 

We first have that x u, x v iox u and v in Figure We also have 
X w. Otherwise, £' must be a child of x in T' and ^ is a sibling of £' in T, a 
contradiction. Therefore, the path from xXo i contains two or more vertices and 
V is the parent of £ in this path. If u is the parent of v in the same path, neither 
(—, v) nor {w, u) is in T', indicating that N' = N — {w, u) also displays T. 

If pt> (v) ^ u in the same path, then (u, v) is not in T' and hence u becomes 
a dummy leaf in T', as there is no leaf other than £ below u in R{w). If (w,u) 
is in T', then (—,m) is not in T' and T' + (—,u) — {w,u) is a subtree of N' in 
which only the dummy leaf u is relocated. Hence, N' also displays T. 

CASE B. Leaves i and £' are siblings in T. 

Then x is a common ancestor of £ and £' in A^. If x = w, the path from x to 
£ in T' must be w, u, v, as this is only path from re to £ in N. Hence, (—, u) and 
(—, v) are not in T'. Therefore, T' is a subtree of N' and N' also displays T. 

li X w, then x is an ancestor of w and hence w is the parent of £' in the 
path from x to £' in T'. Note that Pt'{£) = v. lipT'{v) = u, then (—,u) is in T', 
but both (—, v) and {w, u) are not. T" = T' + {w, u) — (—, u) is a subtree of N'. 
Noting that T” is also a subdivision of T, N' displays T. 

If pt'{v) 7 ^ M, then (—,u) is in the path from x to £ in T'. This implies 
that {u,v) is not in T' and it is a dead-end in T'. If (w,u) is in T', the subtree 
T" = T' + (it, v) — (—, v) of N' is a subdivision of T. If (ic, it) is not in T', the 
subtree T" = T' + {w,u) — (—,ii) — (—,u) of N' is a subtree of N'. Hence, N' 
displays T'. 

Similarly, we can prove that N displays T only if N' displays T when R{w) 
is the subnetwork in the panels D, F, and H in Figure Note also that the 
subnetworks in the panels F and FI are essentially identical (if the positions of v 
and e are switched). Due to the limited space, the details are omitted here. The 
case when R{w) is the subnetwork in Figure is trivial, as deletion of which 
two reticulation branches from v and e does not affect outcome. □ 




By Theoremj^ we are able to determine whether a nearly stable phylogenetic 
network N displays a binary tree T or not by repeatedly executing the following 
tasks in turn until the resulting network N' becomes a tree: 

— Compute a longest path P in N' = N; 

— Simplify N' by considering the subnetwork at the end of P according to the 
cases in Lemma 

— Contract degenerated reticulations in N' and replace the parent of a pair of 
leaves appearing in both N' and T with a new leaf. 

and then check if N' is identical to T. 

Finally, we analyze the time complexity. Let N and T have n leaves. By 
Theorem]^ there are 0{n) vertices and 0{n) branches in N. Since we eliminate 
at least a reticulation in each loop step, the algorithm stops after 0{n) loop steps. 
In each loop step, a longest path can be computed in 0{n) time ([E], page 661), 
as N is acyclic; both the second and third tasks can be done in constant time. 
In summary, our algorithm has quadratic time complexity. 


5 Conclusion 

We have developed a quadratic-time algorithm for the TCP for binary nearly 
stable phylogenetic networks. Our algorithm not only is applicable to a superclass 
of tree-child networks, but also has a lower time complexity than the algorithm 
reported in [7] . Although phylogenetic network models built in the study of viral 
and plant evolution are often nearly stable, it is interesting to know whether the 
TCP is polynomial time solvable or not for networks with other weak properties. 

In particular, the problem remains open for binary networks with the visi¬ 
bility property, but the upper bound we have presented on the number of retic¬ 
ulation vertices of such networks, as well as our algorithm for nearly stable 
phylogenetic networks, provide definitely valuable ideas to solve the problem, 
exactly or heuristically, on phylogenetic networks with the reticulation visibility 
property. 

6 Acknowledgments 

The project was financially supported by Merlion Programme 2013. 


References 

1. Bondy, J.A., Murty, U.S.R.: Graph Theory. Springer (2008) 

2. Cardona, G., Rossello, F., Valiente, G.: Comparison of tree-child phylogenetic net¬ 
works. lEEE/ACM Trans. Comput. Biol. Bioinfo. 6(4), 552-569 (2009) 

3. Chan, J.M., Carlsson, G., Rabadan, R.: Topology of viral evolution. PNAS 110(46), 
18566-18571 (2013) 


4. Dagan, T., Artzy-Randmp, Y., Martin, W.: Modular networks and cumulative 
impact of lateral transfer in prokaryote genome evolution. PNAS 105(29), 10039- 
10044 (2008) 

5. Gusfield, D.: ReCombinatorics: The Algorithmics of Ancestral Recombination 
Graphs and Explicit Phylogenetic Networks. The MIT Press (2014) 

6. Huson, D.H., Rupp, R., Scornavacca, C.: Phylogenetic Networks: Concepts, Algo¬ 
rithms and Applications. Cambridge University Press (2011) 

7. van lersel, L., Semple, C., Steel, M.: Locating a tree in a phylogenetic network. 
Inf. Process. Lett. 110(23), 1037-1043 (2010) 

8. Jenkins, P., Song, Y., Brem, R.: Genealogy-based methods for inference of historical 
recombination and gene flow and their application in saccharomyces cerevisiae. 
PLoS ONE 7(11), e46947 (2012) 

9. Kanj, LA., Nakhleh, L., Than, C., Xia, G.: Seeing the trees and their branches in 
the network is hard. Theor. Comput. Sci. 401, 153-164 (2008) 

10. Marcussen, T., Jakobsen, K.S., Danihelka, J., Ballard, H.E., Blaxland, K., Bryst- 
ing, A.K., Oxelman, B.: Inferring species networks from gene trees in high-polyploid 
north american and hawaiian violets (viola, violaceae). Syst. Biol. 61, 107-126 
( 2012 ) 

11. McBreen, K., Lockhart, P.J.: Reconstructing reticnlate evolutionary histories of 
plants. Trends Plant Sci. 11(8), 103-122 (2006) 

12. Moret, B.M.E., Nakhleh, L., Warnow, T., Linder, C.R., Tholse, A., Padolina, A., 
Sun, J., Timme, R.: Phylogenetic networks: Modeling, reconstructibility, and ac¬ 
curacy. lEEE/ACM Trans. Comput. Biol. Bioinfo. 1(1), 13-23 (2004) 

13. Nakhleh, L.: Computational approaches to species phylogeny inference and gene 
tree reconciliation. Trends Ecol. Evolut. 28(12), 719-728 (2013) 

14. Parida, L. : Ancestral recombinations graph: a reconstructability perspective using 
random-graphs framework. J. Comput. Biol. 17(10), 1345-1370 (2010) 

15. Sedgewick, R., Wayne, K.: Algorithms, 4th Edition. Addison-Wesley (2011) 

16. Treangen, T.J., Rocha, E.P.: Horizontal transfer, not duplication, drives the ex¬ 
pansion of protein families in prokaryotes. PLoS Genetics 7(1), el001284 (2011) 

17. Wang, L., Zhang, K., Zhang, L.: Perfect phylogenetic networks with recombination. 
J. Comp. Biol. 8(1), 69-78 (2001) 



